Conservation of Gene Cassettes among Diverse Viruses of the Human Gut

نویسندگان

  • Samuel Minot
  • Gary D. Wu
  • James D. Lewis
  • Frederic D. Bushman
چکیده

Viruses are a crucial component of the human microbiome, but large population sizes, high sequence diversity, and high frequencies of novel genes have hindered genomic analysis by high-throughput sequencing. Here we investigate approaches to metagenomic assembly to probe genome structure in a sample of 5.6 Gb of gut viral DNA sequence from six individuals. Tests showed that a new pipeline based on DeBruijn graph assembly yielded longer contigs that were able to recruit more reads than the equivalent non-optimized, single-pass approach. To characterize gene content, the database of viral RefSeq proteins was compared to the assembled viral contigs, generating a bipartite graph with functional cassettes linking together viral contigs, which revealed a high degree of connectivity between diverse genomes involving multiple genes of the same functional class. In a second step, open reading frames were grouped by their co-occurrence on contigs in a database-independent manner, revealing conserved cassettes of co-oriented ORFs. These methods reveal that free-living bacteriophages, while usually dissimilar at the nucleotide level, often have significant similarity at the level of encoded amino acid motifs, gene order, and gene orientation. These findings thus connect contemporary metagenomic analysis with classical studies of bacteriophage genomic cassettes. Software is available at https://sourceforge.net/projects/optitdba/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prevalence of Class 1 Integrons and Extended Spectrum Beta Lactamases among Multi-Drug Resistant Escherichia coli Isolates from North of Iran

Background: Extended spectrum beta lactamases (ESBLs) are an important cause of transferable multidrug resistance (MDR) in gram-negative bacteria. The most described ESBL genes are generally found within integron-like structures as mobile genetic elements. The aim of this study was to identify the accompanying of class 1 integrons and ESBLs in the MDR E. coli isolates. Methods: Susceptibility t...

متن کامل

Comparison of Vero and MDCK cell lines transfected with human siat7e gene for conversion to suspension culture

Introduction: Inactivated influenza vaccines are traditionally produced in chicken embryonated eggs but its limitations in producing the required doses in pandemic outbreaks quickly enough has made searching for alternative modes of production necessary. The use of cell culture-based vaccine production is one way of overcoming the limitations of the egg-based method and securing a more rapid re...

متن کامل

Neuraminidase gene sequence analysis of avian influenza H9N2 viruses isolated from Iran

Influenza A viruses possesses two virion surface glycoproteins including haemagglutinin (HA) and neuraminidase (NA). The NA plays an important role in viral replication and promotes virus release from infected cells and facilitates virus spread throughout the body. To find out any genomic changes that might be occurred on NA gene of avian influenza circulating viruses, we have genetically analy...

متن کامل

Phylogenetic analysis of HSP70 gene of Aspergillus fumigatus reveals conservation intra-species and divergence inter-species

Aspergillus fumigatus is a saprophyte fungus, widely spread in a variety of ecologicalniches and the most prevalent aspergilli responsible for human and animal invasiveaspergillosis. The first step to develop novel and efficient therapies is the identificationand understanding of the key tolerance and virulence factors of pathogens. The mainfocus of the present study is to perform the similarit...

متن کامل

Evolution of viruses and cells: do we need a fourth domain of life to explain the origin of eukaryotes?

The recent discovery of diverse very large viruses, such as the mimivirus, has fostered a profusion of hypotheses positing that these viruses define a new domain of life together with the three cellular ones (Archaea, Bacteria and Eucarya). It has also been speculated that they have played a key role in the origin of eukaryotes as donors of important genes or even as the structures at the origi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012